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The delta method is a popular and elementary tool for deriving 
00 I limiting distributions of transformed statistics, while applications of 

asymptotic distributions do not allow one to obtain desirable accu- 
racy of approximation for tail probabilities. The large and moderate 
deviation theory can achieve this goal. Motivated by the delta method 
in weak convergence, a general delta method in large deviations is pro- 
posed. The new method can be widely applied to driving the moder- 
, ate deviations of estimators and is illustrated by examples including 

' the Wilcoxon statistic, the Kaplan-Meier estimator, the empirical 

quantile processes and the empirical copula function. We also im- 
prove the existing moderate deviations results for M-estimators and 
L-statistics by the new method. Some applications of moderate de- 
viations to statistical hypothesis testing are provided. 

> " 

• 1. Introduction. Consider a family of random variables {Yn,n > 1} such 

! as the sample mean. Assume that it satisfies a law of large numbers and 

' a fluctuation theorem such as central limit theorem, that is, 1^—7-0 in law 

. and there exists a sequence 6„ — )• oo such that bn{Yn — 9)^Y'm law, where 9 

is a constant and 1" is a nontrivial random variable. A large deviation result is 
concerned with estimation of large deviation probabilities -P(|^ — ^| > e) for 
e > 0. A moderate deviation result is concerned with estimation of large de- 
viation probabilities P(r„|y„ — ^| > e) for e > 0, where r„ is an intermediate 
^ . scale between 1 and that is, r„ — t- oo and bn/rn — )• oo. In particular, if 

^1 hn = then r„ = n^/'^~^ with < 5 < 1. 
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The large deviation and moderate deviation problems arise in the theory 
of statistical inference quite naturally. For estimation of unknown parame- 
ters and functions, it is first of all important to minimize the risk of wrong 
decisions implied by deviations of the observed values of estimators from the 
true values of parameters or functions to be estimated. Such gross errors are 
precisely the subject of large deviation theory. The large deviation and mod- 
erate deviation results of estimators can provide us with the rates of conver- 
gence and a useful method for constructing asymptotic confidence intervals. 
For the classical large deviation theory with the empirical measures and 
sample means, one can refer to Sanov (1957), Groeneboom, Oosterhoff and 
Ruymgaart (1979) and Bahadur and Zabell (1979). The large deviations for 
linear combinations of order statistics (L-estimators) were also investigated 
in Groeneboom, Oosterhoff and Ruymgaart (1979). Bahadur and Zabell 
(1979) developed a subadditive method to study the large deviations for 
general sample means. For some developments of large deviations and mod- 
erate deviations in statistics, see Fu (1982), Kester and Kallenberg (1986), 
Sieders and Dzhaparidze (1987), Inglot and Ledwina (1990), Borovkov and 
Mogul'sku (1992), Puhalskn and Spokoiny (1998), Bercu (2001), Joutard 
(2004) and Arcones (2006) for large deviations of estimators; Kallenberg 
(1983), Gao (2001), Arcones (2002), Inglot and Kallenberg (2003), Djell- 
out, Guillin and Wu (2006) and Ermakov (2008) for moderate deviations of 
estimators; Louani (1998), Worms (2001), Gao (2003), Lei and Wu (2005) 
for large deviations and moderate deviations of kernel density estimators, 
and references therein. On the other hand, large deviations of estimators 
can be applied to Bahadur efficiency to determine the Bahadur slope [Ba- 
hadur (1967), Nikitin (1995), He and Shao (1996)] and hypothesis testing 
[see Dembo and Zeitouni (1998), Sections 3.5 and 7.1]. 

In statistics, many important estimators are functionals $(Ln) of the em- 
pirical processes L„, and so deriving limiting distribution of r„(<I>(L„) — 
$(//)) from limiting distribution of rn{Ln — /^) is a fundamental problem, 
where r„ is a sequence of positive numbers and ^ is the mean of L„. It is well 
known that the delta method is a popular and elementary tool for solving the 
problem. The method tells us that the weak convergence of rn{Xn — 0) yields 
the weak convergence of rn{^{Xn) — ^*(^)) if *^ is Hadamard differentiable 
(see Section 3), where Xn is a sequence of random variables, is a constant 
and r„ — )• oo. For some developments and applications of the delta method, 
one can refer to Gill (1989), Kosorok (2008), Reeds (1976), and van der 
Vaart and Wellner (1996) among others. For example, Reeds (1976) system- 
atically developed the use of Hadamard instead of Frechet differentiability 
to derive asymptotic distributions of transformed processes. Andersen et al. 
(1993) also described some applications of the delta method in survival anal- 
ysis. More recently, van der Vaart and Wellner (1996) and Kosorok (2008) 
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provided an excellent summary of the functional delta method in terms of 
a weak convergence. 

A natural problem is whether the large deviations of r.„(^>(X„) — ^(9)) 
can be obtained from the large deviations of — 6) if the function <1> 

defined on a set is Hadamard differentiable. When rn = r for all n with 
a constant r, the problem can be solved by the contraction principle [see 
Dembo and Zeitouni (1998)]. When — )■ oo, for each n > 1, define P„ = 
{h;e + h/vn G and = r„($(^ + h/r^) - HO)) for all h £ P„. 

Then by Hadamard differentiability, for every sequence /i„ G T>n converging 
to h, the sequence /„ satisfies /n(^n) — ^ ^'ei^)- Note that fni^niXn — 0)) = 
rni^{Xn) — ^{0))- Motivated by this, we can also consider to use a con- 
traction principle for establishing the large deviations of r„(^>(X„) — $(^)). 
However, the existing contraction principles cannot be applicable to these 
situations as addressed in Remark 2.1 of next section. For this reason, we 
need to extend the contraction principle in large deviations. 

The objective of this paper is to develop a general delta method in large 
deviations similar to that in week convergence and applies the method to 
solve some moderate deviation problems in statistics. The remainder of 
the paper is organized as follows. In Section 2, we present an extended 
contraction principle, while its proof will be given in the Appendix. Then 
a general delta method in large deviations is established by using the ex- 
tended contraction principle in Section 3. In Section 4, we apply the pro- 
posed delta method in large deviations to some statistical models including 
censored data, empirical quantile process, copula function, M-estimators 
and L-statistics. The moderate deviation principles for the Wilcoxon statis- 
tic, the Kaplan-Meier estimator, the empirical quantile estimator and the 
empirical copula estimator are established. We also improve the existing 
moderate deviation results for M-estimators and L-statistics in Section 4, 
where our proofs are different from others but more simple by the new 
method. Section 5 presents some applications of the moderate deviation re- 
sults to statistical hypothesis testing. Some concluding remarks are made in 
Section 6. 

2. An extended contraction principle. As explained in previous section, 
to establish a delta method in large deviation, we first need to generalize the 
contraction principle in large deviation theory. In this section, we present 
an extension of the contraction principle which plays an important role. 

First, let us introduce some notation in large deviations [Dembo and Zei- 
touni (1998), Deuschel and Stroock (1989)]. For a metric space X, 13{X) is 
the Borel cr-algebra of X. Let {Q,J^,P) be a probability space and let T be 
an arbitrary map from to M, where M = [— oo, oo] is the space of extended 
real numbers. The outer integral of T with respect to P is defined by van 
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der Vaart and Wellner (1996) 

^*(r) = mf{£;(C/);C/>r,C/:f]h^l measurable and E{U) exists}. 
The outer probability of an arbitrary subset S of 17 is 
P*{B) = M{P{A);A DB,AeT}. 
Inner integral and inner probability are defined by 

E^{T) = -E*{-T) and P^B) = 1 - P* {n\ B), 

respectively. 

Let {{Q,n,J^n,Pn),n > 1} be a sequence of probability spaces and let 
{Xn,n > 1} be a sequence of maps from Qn to X. Let {X{n),n > 1} be 
a sequence of positive numbers tending to +oo and let [0,+oo] be 

inf-compact; that is, [I < L] is compact for any L £ M. Then {Xn,n > 1} is 
said to satisfy the lower bound of large deviation (LLD) with speed A(n) 
and rate function I, if for any open measurable subset G of X, 

(2.1) /(G) :=liminf-^logP„,(X„ £ G) > - inf I{x). 

{Xn, n > 1} is said to satisfy the upper bound of large deviation (ULD) with 
speed A(n) and rate function I, if for any closed measurable subset F of X, 

(2.2) ^/(F):=limsup-^logP„*(X„GF)<- inf /(x). 

We say that {X„,n > 1} satisfies the large deviation principle (LDP) with 
speed A(n) and rate function /, if both LLD and ULD hold. 
Now, we present the extended contraction principle. 

Theorem 2.1 (Extended contraction principle). Let {X,d) and {y,p) 
be two metric spaces. Let {T>ri,n > 1} be a sequences of subsets in {X,d), 
and let {/„ : Vn ^ y;n £ N} be a family of mappings. Also for each n>l, 
let Xn be a map from probability space (il„, P„) to P„. Suppose that: 

(i) {Xn,n > 1} satisfies the large deviation principle with speed \{n) and 
rate function I ; 

(ii) there exists a mapping f -.{I < oo} i— )• y such that if for a sequence 
{xn G T^njU > 1}, Xn ^ X G {I < oo} as u ^ OO, then fn{xn) — ^ f{x) as 
n — oo. 

Then {fniXn),n> 1} satisfies the large deviation principle with speed A(n) 
and rate function If, where 

(2.3) If{y) = mf{I{xy,f{x)=y}, yey. 



DELTA METHOD IN LARGE DEVIATIONS 



5 



The proof of the theorem is given in the Appendix. 

Remark 2.1. (1) If Vn = X for aU n > 1, then Theorem 2.1 yields 
Theorem 2.1 in Arcones (2003b). Another popular contraction principle was 
given in Theorem 4.3.23 of Dembo and Zeitouni (1998), in which Dn = ^ 
for all n > 1, /„ is continuous for all n > 1 and for any L € (0, oo), 

(2.4) hm sup p{fn{x)Jix))=0. 

"^°°x:7(x)<L 

This condition cannot be compared to condition (ii) in Theorem 2.1. 

(2) It is necessary for proving Theorem 3.1 to introduce the sequence of 
subsets T>n in Theorem 2.1, because subsets {h€ X;9 + h/vn S P$},n > 1 
are not equal, generally, ioi 6 ^ X and a subset of a topological linear 
spaces X . In fact, 2?$ is usually a subset of X in applications (see Section 4). 

3. Delta method in large deviations. In this section, we establish a delta 
method in large deviations by using the extended contraction principle pre- 
sented in Section 2. 

Let us first recall some conceptions of Hadamard differentiability [Gill 
(1989), van der Vaart and Wellner (1996), Kosorok (2008), Romisch (2005)]. 
Let X and 3^ be two metrizable topological linear spaces. A map ^ defined 
on a subset P$ of X with values in 3^ is called Hadamard differentiable at x 
if there exists a continuous mapping ^'^:X such that 

(3.1) limi(^±^IiM^^ 

n— >oo tn 

holds for all sequences converging to 0+ and converging to /i in 
such that X + tnhn £ for every n. 

Remark 3.1. Linearity of the Hadamard directional derivative $^(-) is 
not required. In fact, $^(-) is often not linear if $ is given by inequality 
constraints. However, by the definition, we can see that $^(-) is positively 
homogenous; that is, $^(t/i) = t^'^{h) for all t > and h£ X. 

The definition of the Hadamard differentiable may be refined to Hadamard 
differentiable tangentially to a set Dq C X. For a subset Pq of the map <I> 
is said to be Hadamard differentiable at x € T>^ tangentially to T>q if the 
limit (3.1) exists for all sequences tn converging to 0+ and hn converging 
to h in Dq such that x + tnhn S 2?$ for every n. In this case, the Hadamard 
derivative '^^(•) is a continuous mapping on Vq. If Vq is a cone, then $^(-) 
is again positively homogenous. 
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Theorem 3.1 (Delta method in large deviation). Let X and y he two 
metrizable linear topological spaces and let d and p he compatible metrics on 
X and y , respectively. Let $ (ZX ^y he Hadamard-differentiahle at 6 
tangentially to Dq, where T)^ and Pq oif^ two suhsets of X . Let X„:r2.„i— )■ 
T>^,n >1 he a sequence of maps and let rn,n > 1, he a sequence of positive 
real numhers satisfying Vn — )• +oo. 

If {r„(X„ — 0),n > 1} satisfies the large deviation principle with speed 
\{n) and rate function I and {I < oo} C Vq , then {rn{^{Xn) — ^{0)),n>l} 
satisfies the large deviation principle with speed A(n) and rate function 
where 

(3.2) I^,{y) = mf{I{x);'P'e{x) = y}, yey. 

Furthermore, if ^'g is defined and continuous on the whole space of X , 
then {rn{^{Xn) — ^{6)) — ^^{rniXn — 9)),n> 1} satisfies the large deviation 
principle with speed A(n) and rate function 



0, z = 0, 

+00, otherwise. 



(3.3) U,eiz) ■■ 

In particular, for any 5 > 0, 

Ihnsnp -^log P*{p{rnmXn) - m) " '^'eirn{Xn - 0)),O) > 6) 

(3.4) 

= — oo. 

Proof. For each n > 1, define P„, = {h £ X;9 + h/rn G Vq,} and 

f,,:Vn^y, fn{h)=rn{He + h/rn)-He)) for all he Vn. 

Then for every sequence /i„ G P„ converging to h^Vo, the sequence satis- 
fies fn{hn) ^'o{h)- In addition, ^'q{-) is continuous on Vq. Therefore, Theo- 
rem 2.1 implies that 

{rniHXn) - He)),n> 1} = {fn{rn{Xn " e)),n> 1} 

satisfies the large deviation principle with speed A(n) and rate function I^/^. 

Now, we consider the mapping (pn :Pn i— ^ 3^ x 3^, where (pn{h) = {fn{h), 
$g(/i)) for all h G If <^g(-) is continuous on X, then for every subsequence 
hn' G V n' converging to /i G X, ipn'{hn') converges to ($g(/i), ^'^(/i)). Hence, 
Theorem 2.1 implies {y3„,(r„(X„ — 0)),n > 1} satisfies the large deviation 
principle with speed A(n) and rate function 

^$,e(yi,y2) =inf{/(x);$e(x) = yi = y2}, (2/1,2/2) G 3^ x 3^. 

Therefore, by the classical contraction principle [see Dembo and Zeitouni 
(1998), Theorem 4.2.1], we conclude that the difference 

{rniHXn) - H0)) - ^'e{rn{Xn - 0)),n > 1} 
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satisfies the large deviation principle with speed A(n) and rate function 

mi{J^^e = {yi,y2);yi-y2 = z} = h^e{z) iorz^y. □ 

4. Moderate deviations of estimators. In this section, moderate devia- 
tion principles for some estimators will be established by applying the delta 
method in large deviation to Wilcoxon statistic, Kaplan-Meier estimator, 
the empirical quantile processes, M-estimators and L-statistics. 

Let us introduce some notation. Given an arbitrary set T and a Banach spa- 
ce (B, II ■ ||b), the Banach space looiTjM) is the set of all maps z:T h^M that 
are uniformly norm-bounded equipped with the norm ||z|| = sup^g^" II-2(*)I|b- 
Let loo(T) be the Banach space of all bounded real functions x on T, 
equipped with the sup- norm ||x|| = sup^gj^ |x(t)| . It is a nonseparable Banach 
space if T is infinite. On /oo(^)) we will consider the u-field B generated by 
all balls and all coordinates x{t),t G T. 

Let {S,d) be a complete separable and measurable metric space and let 
bS be the space of all bounded real measurable functions on {S,S) where S 
is the Borel u-algebra of S. Let {X, Xn,n > 1} be a sequence of i.i.d. random 
variables with values in S" on a probability space {i},J-,P), of law /u. Let L„ 
denote the empirical measures; that is, 

1 " 

Ln = -'S^Jx,, n>l. 

1=1 

For given a class of functions ^ CbS, let looid) be the space of all bounded 
real functions on 5 with sup-norm ||-F||5 = supjgj |-F(/)|. This is a Banach 
space. Every i/ G Mf,{S) [the space of signed measures of finite variation on 
(5,5)] corresponds to an element = v{f) = f / dz^ for all / G 5^. 

Let D[a,b] denote the Banach space of all right continuous with left-hand 
limits functions z : [a, 6] i— t- M on an interval [a, 6] C M equipped with the 
uniform norm. Let BV[a, b] denote the set of all cadlag functions with finite 
total variation and set BVM[a,b] = G BV[a,b]; J \dA\ < M}, where the 
notation J \ dA\ denotes the total variation of the function A. In this article, 
we also let {a„ = a(n),n > 1} be a sequence of real numbers such that as 
n — )• oo, 

a„ — )■ oo and an/y/n^O. 

4.1. Moderate deviations for Wilcoxon statistic. Let Xi, . . . ,Xm and Yi, 
■ ■ ■ ,Yn be independent samples from distribution functions F and G on M, 
respectively. If and Gn are the empirical distribution functions of the 
two samples; that is, 

^ m 1 

Fm{x) = — J x,{i-oo,x]) and G„(x) = - (5y. ((-oo, x]), 

i=\ 1=1 
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then the Wilcoxon statistic is defined by Wm,n = f Fm dGn- It is an estimator 
oiP{X<Y). 

Theorem 4.1. Assume that m / [m + n) ^ as m,n ^ oo. Then 



(4.1) {:p^^±^{ iF^dG^- lFdG],n>l 



a{mn/{m + n)) 

satisfies the LDP in M with speed a"^ {mn / {m + n)) and rate function 



defined by 

(4.2) I^ix) 



rW ( ^ ^ 



2(AVar(F(y)) + (1 - A) Var(G(X))) ' 

Proof. Applying Theorem 2 of Wu (1994) to = \ Yll=i ^x,, di = 
{{-oo,x];x eR}, and = ^_Yli=i^Yi, ^2 = {{-oo,y];y eR}, respectively, 
and using the product principle in large deviations [Dembo and Zeitouni 

(1998)], we obtain that {^(F„ - F, G„ - G), n > 1} satisfies the LDP in 
/oo(I^) X ^oo(]R) with speed and rate function {/^^(a) + /g(/3)}, where 



/F(a) = inf|i j ^\x)dF{, 



j{x)dF{x) = 0,a{t)= / j{x)dF{x) 

J{~oo,t] 

for each t G R, 7 : M — t- M is measurable 




a'p{x)\'^ dF{x), ifa<Fand lim \a{t)\=0, 
otherwise 

and a'p = da/dF. Since ^ (1 - A)!/^ and > X^/^, 

y/'mn/{m+n) Wmn/(m+n) 

then 

^/ mn/{m + n) 



iFm-F,Gn-G),n>l 



a{mn/{m + n)) 

satisfies the LDP in /oo(IK) x ^00 (IK) with speed a^(mn/(m + n)) and rate 
function given by 

lF,G{a,l3) = Y^Ipia) + jW). 

Note that {lF,Gia,P) < 00} C BV{R) x BV{R) and (F^,G„) G BVi{R) x 
BVi{R). For each M > 1, we consider the map $:i:)(M) x BVm{R) ^ R 
defined as 



^A,B)= [ A{s)dB{s). 
Jr 
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Then = / FmdGn, and by Lemma 3.9.17 of van der Vaart and 

Wellner (1996), ^> is Hadamard differentiable at each {A, B)€V^ = {/ \dA\ < 
oo} and the derivative is given by 

^AB(a,/3)= / A{s)dm+ [ a{s)dB{s), 
Jr Jr 

where J^^ A(s) df3{s) is defined via integration by parts if /3 is not of 
bounded variation; that is, 

A{s) dp{s) = A{b)p{h) - A{a)P{a) - j P{s-)A{s). 

a,b] J (a, 6] 

Thus, by Theorem 3.1 with Vq = {{a, f5);lF{(y) < oo,/g(/?) < oo}, we con- 
clude that 



Fm.dG„ - / FdG ,n>l 



■sj mn/[m + n) 
a{mn/{m + n)) 

satisfies the LDP on M with speed a^{mn/ {m + n)) and rate function given by 
lW{x) = \nii^-^lF{a) + ^lG{P), j F{s)d(3{s) + j a{s)dG{s)=x 

j Fji'adG- j GapdF = x,a<.F, 
/3<^G, lim |a(t)|=0, lim |/3(t)|=0 



X^ 



2(AVar(F(y)) + (l- A)Var(G(X)))' □ 

4.2. Moderate deviations for Kaplan-Meier estimator. Let X and C be 
independent, nonnegative random variables with distribution functions F 
and G. Let Xi, . . . ,Xn be i.i.d. random variables distributed according to 
the distribution function F and let Ci, . . . , C„ be i.i.d. random variables dis- 
tributed according to the distribution function G. Xi, . . . , Xn and Ci, . . . , C„ 
are assumed to be independent. Observed data are the pairs (Zi, Ai), . . . , (Z„, 
A„), where Zj = XiAGi, and = l{Xi<Ci}- The cumulative hazard function 
is defined by 

(4.3) Ait) = [ J- dF{s) = [ J- dH-^is), 

J[o,t] F{s) J[Q,t] H{s) 
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where F{t) = P{X > t) and H{t) = P{Z > t) are (left-continuous) survival 
distributions, and H^^^{t) = P{Z < t, A = 1) is a subdistribution function of 
the uncensored observations, where A = l{x<c}- We also denote H'^{t) = 
P(Z < t, A = 0). The Nelson-Aalen estimator is defined by 

(4.4) An{t)= f =^dHl'^{s), 



[0,t] Hn{s) 

where 

(4.5) ^rW = -EWA=i} Hn{t) = -Y,hz.>t} 

i=l 1=1 

are the empirical subdistribution functions of the uncensored failure time 
and the survival function of the observation times, respectively. 
The distribution function F{t) can be rewritten as 

^-m= n (i-'^A(s)). 

0<s<t 

The Kaplan-Meier estimator Fn{t) for the distribution function F{t) is de- 
fined by 

(4.6) l-Fn{t)= W {l-dKn{s)). 

0<s<t 

The Kaplan-Meier estimator F^ is the nonparametric maximum likelihood 
estimator of F in the right censored data model, proposed by Kaplan and 
Meier (1958). Dinwoodie (1993) studied large deviations for censored data 
and established a large deviation principle for sup^-g,- \Fn{x) — F{x)\ where 
T is a fixed time satisfying {1 — F{t)}{1 — G(r)} > 0. Bitouze, Laurent 
and Massart (1999) obtained an exponential inequality for sup^g^Kl — 
G{x))\Fn{x) — F{x)\}. Wellner (2007) provided a bound for the constant 
in the inequality. In this subsection, we establish its moderate deviation 
principle. 



Theorem 4.2. LetT>0 such that H{t) < 1. Then {^(A„ - A),n > 1} 
satisfies the LDP in D[0,r] with speed a^lji) and rate function given by 

I^(<A) = inf|/^,G(a,/3); / ^da{s)-[ ^^dH^'{s) = m, 
(4.7) 

for any f S [0, r] > , 
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where 

1 



(4.8) lF,Gi(^,/3) = { 



2 



a'^.c(u)|2(iii'"^(^z)+ / |(a + /3)'^.(n)|2d/7^(^z) , 



ifa<^ F"^ a + P^.H" and lim = 0, 

s>oo 



oo, otherwise. 



Proof. The pair {H^^,Hn) can be identified with the empirical distri- 
bution of the observations indexed by the functions = {I{z<t,A=i}Tt S M} 
and §2 = {I{z>t}Tt G K}. It is easy to verify that the two classes 5i and ^2 
are Donsker classes and the mapping : l^o (5^^ 1— >• ^oo (5^1 ) x ^00 (5^2 ) defined by 
(j) — > , (^Isa) is continuous, where 5^ = IJ^-^^^^j- Applying Theorem 2 of 
Wu (1994) to Ln = ^ X^iLi (^{Zj.A,) and ^, and the classical contraction prin- 
ciple [see Dembo and Zeitouni (1998), Theorem 4.2.1] to ^, we can get that 

{H:i'^-H^^,Hn-H),n>l 



a{n) 

satisfies the LDP on D([0,r]) x D([0,r]) with speed a^{n) and rate function 
I^,G(a,/3)=inf|V/'72(n)dF-(n)+ / ^l{u) dH' {u)\, 



71 (n) dH^'^u) + j 7o(n) dF^(n) = 0, 
and for any t G [0,oo), / 7i(n) d//"^(ii) =a(t). 



71 (n) dH^'{u) + / 7o(t^) tZ^'(t^) = m 

[t,oo) J [t,oo) 



a'Hu.{u)\'' dH'^^u) + j \{a + fi)'^^{u)\''dH'{u) 
if a < i/"^ a + /3 <C ii""^ and lim |/3(t)| = 0, 



00, otherwise. 

Set P$ = {(A,-B) G SFi([0,r]) x L>([0, r]); 5 > (r)/2}. By the Dvoretzky- 
Kiefer-Wolfowitz inequality [cf. Massart (1990)], for any e > 0, 

P{ sup \Hn{t) - H{t)\ >e]< 2exp{-27ie2}. 

V6[0,t] ^ 

In particular, take e = H(t)/2, then we have 

lim sup log P* ((i^r, ;H^„,) ^ P$) 
(4.9) ^ 



< lim sup ^— logP*iHn{t) < H{t)/2) = -00. 
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Consider the maps <I>i:V^cBVi{[0,t])xD{[0,t])^BVi{[0,t])xD{[0,t]) 
and ^2-BV{[0,t])xD{[0,t])^D{[0,t]) defined as 

^i{A,B) = {A,l/B) and <^2-{A,B)^ [ BdA. 

Jio,-] 

Define = ^>2(^>i(A,S)). Then ^>(i?«^;ff„) =A„, = A and 

by Lemma 3.9.17 of van der Vaart and Wellner (1996), $ is Hadamard 
difFerentiable at each {A,B) G The derivative is given by 

J[o,t] B{s) J[Q t] B\s) 

Applying Theorem 3.1 to 17„ = {(i?^^F„) G P„(-) = P(-|J1„) and Vq = 
together with (4.9), we conclude that {^^^(A.„ — A),n > 1} satisfies the 
LDP in L'[0,t] with speed a^(n) and rate function given by 

/^(0) = inf|/^,G(«,/?); / ^da{s)-f l^dH^%s) = m, 
{ 7[o,t] H{s) J[o,t] H (s) 

for any t G [0,r]|. ^ 

Next, we give some other representations. Let {{G^'^{t),G(t)),t G [0,r]} 
be a zero-mean Gaussian process with covariance structure 

E(G"^(s)G""(t)) = F"^(s At) - H'^^{s)H'^^{t), 
E(G{s)G{t)) = H{s V t) - H{s)H{t), 

and 

EiG^'^{s)Git)) = (i/-(s) - i/-(t-))/(_„o,,](t) - H^'^{s)H{t). 

Set r = {(i,t),j = l,2,tG [0,t]} and 

Z = {Z^^^^yJ = l,2,t G [0,r],Z(i,,) = G-(0,%t) = G(t)}. 

Then by Theorem 5.2 of Arcones (2004), {{%t)/y^A(n), {j,t) ef},n> 1} 
satisfies LDP on loo{T) with speed A(n) and rate function given by 

i{x) = inf{i^(72); 7 G £, ^(7%,t)) = a^o,*) for ah (j,t) G f }, 

where £ is the closed vector space of Lp'{P) generated by {Zj-j^^), (j,t) G T}. 
Since the mapping ^:/oo(^) ^ ^oo([0,t],M^) defined by 

{<l^m , (i, t) G T} ^ { (</.(i,t) , <t>i2,t) ) , i G [0, r]} 
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is continuous, then by the classical contraction principle [see Dembo and 
Zeitouni (1998), Theorem 4.2.1], we know that {-^i=(G"'=, G), n > 1} sat- 
isfies the LDP on L)([0,r]) x D([0,r]) with speed A(n) and rate function 
lF,G{oi,f3), where A(n) — )■ oo as ?i— )-oo. 

Define M"'=(t) = G"^(t) - /^^^j G{u) dA{u) and 

(4.10) Z{t) = [ J- dG-^is) - [ ^ dH-^{s), 

where the first term on the right-hand side is to be understood via integration 
by parts. Then M"*^ is a zero-mean Gaussian martingale with covariance 
function [van der Vaart and Wellner (1996), page 384] 

£;(M"^(s)M"^(t)) = / H{u){l-AA{u))dA{u), 

where AA(u) = A(u) — A{u—) and Z{t) = =^ dM^'^{s) is a zero-mean 
Gaussian process with covariance function 

E{Z{s)Zit))= [ l^^dA(n). 

J[0,sAt] ti{u) 

Therefore, by Theorem 2.1, we conclude that {{Z{t)/ ^ A(n), t G [0, r]}, n > 1} 
satisfies the LDP on D([0,r]) with speed A(n) and rate function I^{cj)). Fur- 
thermore, from Theorem 5.2 of Arcones (2004) and Theorem 3.1 of Arcones 
(2003b), we have the following result. 

Theorem 4.3. Let t > such that H{t) < 1. T/ien {^(A„- A),n>l} 
satisfies the LDP in D[0,t] with speed a'^{n) and rate function L^ given by 

{m 
4 = 1 

^ m 

(4.11) " 2 ^ 

k,j=i 

[0,tf,At,] H{u) J 
In particular, for any r > 0, 

1 / /n \ 

(4.12) lim ^--logP sup | A„(x) - A(x)| > r 



where n\ = /^^^j dK{u). 
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Now we present the moderate deviations for the Kaplan-Meier estimator 

Fn{t). 

Theorem 4.4. Letr > Q such that H{t) <l. Then {-^{Fn - F),n> 1} 
satisfies the LDP in D[0, r] with speed a'^{n) and rate function given by 



m 

/^^^((/.)= sup sup {^(l^ua, 

m>l,ti,...,tm€[0,T] ai,...,QmGM [ -^-^ 



k,j=l 

(4.13) 



2 ^ OfcOj 



(l-AAM)Jf(u) 'J 



In particular, for any r > 0, 



(4-14) Jim^-^logpf^^ snv\F^{x)-F{x)\>r)=--^, 



n->oo n 



'^j x6[0,t] / 2f7 



where 



a']^M= sup (l-F(t))^ / i — = — dMu). 

Proof. The map $ : BV[0, r] C D[0, t] ^ D[0, r] is defined as 
HA){t)= n {l + dA{s)). 

0<s<t 

Then, 1 - F{x) = ^>(-A)(x) and 1 - Fnix) = ^>(-A„)(x). Since H{t) < 1, 
there exists some M G (0,oo) such that A G BVm[0,t]. From (4.12), we have 

linisup^^logP*(A„ ^ ByM+i[0,T]) 
n— 5>oo a [n) 

< hm ^ logP( sup |A„(x) — A(a;)| > l) = — oo. 

By Lemma 3.9.30 of van der Vaart and Wellner (1996), we know that <I> is 
Hadamard differentiable in BVm+i[0,t] with derivative 

^'^{a){t) = [ '^{A){0,u)^{A){u,t]da{u), 
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where ^{A){u,t] = n«<s<i(l + '^^(^))- Applying Theorem 3.1 to Qn = {An S 
BVm+i[0, r]}, Pn{-) = P{-\^n) and Vq = BVm+i[0, t], we obtain from Theo- 
rem 4.2 that {^{Fn - F),n> 1} satisfies the LDP in D[0,t] with speed 
o?{n) and rate function J-^*^ given by 

I^^=inf|/^(a); 

/ $(F)(0,n)$(F)(n,t]da(n) =(/)(t), for any t G [0, r] I . 
J{o,{\ J 

On the other hand, we consider the process <J>'_^(Z)(t), where Z is defined 
by (4.10). Since 

^~A{Z){t)= ^q—^ dZ{u) 

J{o,t] 1 - F{u) 

= {l-F{t))[ ^ dZ{n), 

J{o,t] 1 - AA(u) 

which is a zero-mean Gaussian process with covariance function 

(1 - F{s)){l - F{t)) [ ] dA{u), 

J{o,sAt] (1 - AA{u))H{u) 

then, by Theorem 5.2 of Arcones (2004) and Theorem 3.1 of Arcones (2003b), 
we obtain the conclusion of the theorem. □ 

4.3. Moderate deviations for the empirical quantile processes. For a non- 
decreasing function G G D[a, b] and any p e M, define G~^{p) = inf{x; G{x) > 
p}. Let Di[a,b] denote the set of ah restrictions of distribution functions on 
M to [a, b] and let D2[a, b] denote the set of distribution functions of measures 
that concentrate on {a,b]. 

Theorem 4.5. Let < p < q < 1 be fixed and let F be a distribution 
function with continuous and positive derivative f on the interval [F^^{p) — 
e, F~^ (q) + e] for some e > 0. Let Fn be the empirical distribution function of 

an i.i.d. sample Xi, . . . , of size n from F . Then {-^^{F~^ — F~^),n > 1} 

satisfies the LDP in loo\p, q] with speed a^{n) and rate function I^^ given by 

I''Q{4>) = inf|/^(a); -^||^^|f|y = ^ for all x G [p, 



where 




, , , , \ap{x)\^ dFix), ifa<^F and hm |a(t)|=0, 

lF\Oi) = \ 2 J \t\^oo 

otherwise. 
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Proof. Applying Theorem 2 of Wu (1994) to L„ = IYa=i^x,, and 
5 = {(— oo, x]; X G M}, we know that {^^{Fn — F),n > 1} satisfies the LDP 
on L'(M) with speed a^(n) and rate function Ip. By Lemma 3.9.23 of van 
der Vaart and Wellner (1996), it follows that the inverse map $ : G i— ?■ 
as a map Di[F~^{p) — e,F~^{q) +e]i-^ Ioo[p,q] is Hadamard differentiate 
at F tangentially to C[F~^{p) — e,F~^{q) + e], and the derivative is the 
map a I— 7- —a{F~^)/f{F~^). Therefore, by Theorem 3.1, we conclude that 

{^^(F~^ — F~^),n > 1} satisfies the LDP in loo[p,q] with speed a^(n) and 
the rate function I^^ . □ 

4.4. Moderate deviations for the empirical copula processes. Let BV^{M.'^) 
denote the space of bivariate distribution functions on M^. For H G BV^{M?), 
set F{x) = H{x, oo) and G{y) = H{oo, y). 

Let (Xi, li), . . . , (X„, 1^) be i.i.d. vectors with distribution function H. 
The empirical estimator for the copula function C{u, v) = H{F~^{u),G~^{v)) 
is defined by Cn{u,v) = Hn{F~^ (u) , {v)) , where Hn, Fn and Gn are the 
joint and marginal empirical distributions of the observations. 

Theorem 4.6. Let 0<p<g<l he fixed. Suppose that F and G are con- 
tinuously differentiable on the intervals [F~^{p) — e,F~^(q)+e] and [G~^(j)) — 
e, G~^ (q) -\- e] with strictly positive derivatives f and g, respectively, for some 
e > 0. Furthermore, assume that dH/dx and dH/dy exist and are continu- 
ous on the product intervals. Then {-^^{Cn — C),n > 1} satisfies the LDP 
in Zoo([p, 5]^) with speed a^in) and rate function L^ defined by 
L^{4>)=ini{LH{a)-<^'H{a) = 4>], 

where 

^'M{u,v)=a{F-Hu),G-\v))-^{F-\n),G-Hv)f^^~'^''^''^^ 



dx' ' " ' /(F-i(n)) 

-^(F (-)'G (-)) • 

Proof. By Theorem 2 of Wu (1994), we know that 

'^^{Xk,Yk)((-^^^] ^ (-00,2/]) - H{x,y) G • 



n 



satisfies the LDP on D{E?) with speed a^(n) and rate function defined as 
///(a) = inf|^ j j^{x,y)H{dx,dy); a{s,t) = j -f{x,y)Li^^^s^y^tyH{dx,dy) 

for each (s, t) G ]R^ and j jdH = o| 
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- [ {a'jj)'^{x,y)H{dx,dy), if a ^ and lim \a{s,t)\=0, 

2 J \s\,\t\^oo 

oo, otherwise. 

Then, by Lemma 3.9.28 of van der Vaart and Wellner (1996), we con- 
clude that the map ^ : H ^ H{F-\G-^) as a map BV^iR"^) C ^(l^) ^ 
Ioo{\p,q]'^) is Hadamard differentiable at H tangentially to C(M ), and the 
derivative is Therefore, it follows from Theorem 3.1 that, {-^^{Cn ~C), 

n > 1} satisfies the LDP in loo{[p, q]"^) with speed a^(n) and rate function l'^ 
as defined in the theorem. □ 

4.5. Moderate deviations for M -estimators. M-estimators were first in- 
troduced by Huber (1964). Let X be a random variable taking its values 
in a measurable space {S,S) with distribution F, let Xi, . . . ,Xn be a ran- 
dom sample of X, and let Fn denote the empirical distribution function 
of X. Let be a Borel subset of M*^. A M-estimator 0„(Xi, . . . , X„) over 
the function 5 is a solution of 



g{x,9n) dFn{x) = iiii / g{x,9) dFn{x). 
eee j 

If g{x, 9) is differentiable with respect to 9, then the M-estimator 9n{Xi, . . . , 
Xn) may be defined as a solution of the equation 

Vgg{x,9n)dFnix)=0, 

where Vgg{x, 9) = (^^p^, . . . , ^^ggf^ ) • The detailed description on Af -estima- 
tors can be found in Serfiing (1980). 

Jureckova, Kallenberg and Veraverbeke (1988), Arcones (2002) and Inglot 
and Kallenberg (2003) studied moderate deviations for M-estimators. In 
this subsection, we study the problem by the delta method. Let jp{x,9) = 
{ip^{x,9), . . . ,iIj'^{x,9)) : S X 01— )']R'^. We also need the following conditions. 

(CI) 'ijj{x,9) is continuous in 9 for each x £ S, and ip{x,9) is measurable 
in X for each 9 €Q. 

Define 



^{9) = i^\9),..., ^^(^)) = E{iPiX, 9)) = J ^(x, 9) dF{x), 9 G G, 
and 

^,,{9) = {^i{9),...Mnm = -Y.^{X^,9) = J i;ix,9)dFnix), 9ee. 



i=l 
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(C2) ^ has a unique zero at Oq; there exists some r] > such that B{9o,7]) := 
{6 G '-,19 — 9q\ < r]} C Q and ^' is homeomorphism on B{9Q,ri); ^' is differen- 
tiable at with nonsing ular derivative A-.R"^ ^R"^; and E{\ip{X,9)\'^) < oo. 

Let C{B{9(),r])) denote the space of continuous R^'-valued functions on 
B{9o,7]) and define ||/|| = sup,g5(,,^ |/(0)| for / G C(Bi9o,7])). Let *o(^) 

and ^'on be the restrictions of ^ and ^„ on B{9o,r]), respectively. 
(C3) {a(n),n> 1} satisfies 

(4.15) a(n) /'oo and -^\0 

and > 1} satisfies 



(4.16) 4^ sup |*„(^)-*(0)| Ao 
and 

(4.17) hmsup^^logfnPf sup |V'(^, 6l)| > Vna(n)') ) 



-oo. 



Remark 4.1. Let y be a random variable taking its values in a Banach 
space and E(Y) = 0. If there exists a sequence of increasing nonnegative 
functions {Hf;,k > 1} on (0, +00) satisfying 

(4.18) lim u-^Hk{u) = +00, lim lim log ^k{Vna{n)) ^ 

u^oo fc-s.oo n-5>oo (n) n 

and 

(4.19) E{Hk{\\Y\\)) < 00 for any k>l, 
then 

(4.20) limsup^^log(nP(||y|| > ^/^a{n))) = -00. 

In particular [cf. Chen (1991), Ledoux (1992)], if for each A; > 1, 

i?(||yf(iog||y||)'=)<+oo. 



then (4.20) holds for a(n) = ^log logn; if for each A; > 1, 

s(||yf)<+oo, 

then (4.20) holds for a(n) = ylogn; if for some 1 < p < 2, there exists some 
6 > such that 

(4.21) E{exp{6\\Yf-P}) < +00, 
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then (4.20) holds for a{n) = o{n^'^ p)/'^py^ if for some 1 <p < 2, and 
(4.22) E{exp{6\\Yf'P}) < +oo for ah S>0, 

then (4.20) holds for a(n) = 0(n(2-p)/2p). 



In fact, by Chebychev's inequality, 

P(||y|| > ^/^a{n)) < 



E{Hk{\\Y\ 



Hk{y/na{n)) ' 
Hence, (4.18) and (4.19) yield (4.20). 

Lemma 4.1 [See Lemma 4.3 in Heesterman and Gill (1992)]. Assume 
that (CI) and (C2) hold. Then there exists a neighborhood V of in 
C(B{9o, 7])) and a functional $ : C(S(6'o, r])) ^ B{eo,r]) such that /($(/)) = 0, 
for any f £ V, and $ is Hadamard dijferentiable at ^'^ with derivative 
$'^^(/) = -^-1/(^0). 

Theorem 4.7. Suppose that (CI), (C2) and (C3) hold. Define 

(4.23) 0„ = ^.(^on). 

Then {^^(^n — ^o)) ^ ^ 1} satisfies the LDP with speed a^{n) and rate func- 
tion 

(4.24) I^\z) = ^{Az,r-^Az), 
where T is the covariance of iP{X,9q) — ^{9q), and 

(4.25) limsup^^logP(^'„(^„) /O) = -oo. 

a^(n) 



n—^oo 



Proof. Set r = {l,...,4 x B{9o,r]). Since 

TxT3 ((i, s), ij,t)) ^ d{{i, s), {j,t)) := (Var(V^^(Xi, t) - iXi,s))f^ 

is continuous on T x T and d{{i,t), {i,t)) = 0, then {T,d) is totally bounded. 
Hence, under (C3), Theorem 2.8 in Arcones (2003a) yields that 

{{^i%ni^) - '^KO)),ii,0) G T},n > 1} satisfies the LDP in 1^{T) with 
speed a'^{n) and rate function 

/(/) = i inf{i^(a2(X)); /(z, 9) = E{a{X){^\X, 9) - ^\9)))] 

satisfying 

limsup|inf{/(/);||/||>A} = -oo. 
A-5>oo A 

Then, applying the classical contraction principle [see Dembo and Zeitouni 
(1998), Theorem 4.2.1] to l^{f) 3f^ (/(1, 9),..., fid, 9)) G loo(B{9o, v), 
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we obtain that {{^{'^Qni9)-^i9)),9 £ B{eo,'n)},n>l} satisfies the LDP 
in C{B{9Q,ri)) with speed a^(n) and rate function 

/(/) = lmf{Eia\X));fi9)=E{aiX)ii;{X,9) - $(0)))}. 
Therefore, we have 

hmsup ^] logP(^on ^y) = -oo, 

and so (4.25) holds. Then, by Theorem 3.1, we conclude that {-^^{(^n — 
n > 1} satisfies the LDP with speed a^{n) and rate function 

I^{z) = \ \ni{E{a\X)),E{a{X){ij{X, 9^) - $(^o))) = -Az} 

= \{Az,T-^Az). □ 

Remark 4.2. Comparing with Theorem 2.8 in Arcones (2002), in The- 
orem 4.7, we remove the condition 

limsup ^ logPdgn - 9q\ > e) = -oo, 

n^oo d [n) 

which is required by Arcones (2002). 

4.6. Moderate deviations for L-statistics. Let Xin < X2n < • • • < Xnn be 
the order statistics of a random sample Xi , Xn from a random variable X 
with distribution function F{x) and let J be a fixed score function on (0, 1). 
Also let Fn be the empirical distribution function of the sample. We consider 
the L-statistics of the form 

" ri/n fl 

Ln:=y^Xin J{u)du= F~\s)J{s)ds. 

j=l J{i-l)/n Jo 

Groeneboom, OosterhofF and Ruymgaart (1979) had obtained some large 
deviations for L-statistics. The Cramer type moderate deviations for L- 
statistics had been studied in Vandemaele and Veraverbeke (1982), Bentkus 
and Zitikis (1990) and Aleskeviciene (1991). In this subsection, we study the 
moderate deviation principle for L-statistics by the delta method. 

Take X = loo{^) and J' = M. Let P$ be the set of all distribution functions 
on M, and set Vq = {a{G - F); G G a G M}. Define ^> : P$ R as follows: 



xJ{G{x))dG{x). 



$(G)= / G-\s)J{s)ds= [ 
Jo J-i 

Assume that EiX"^) < oo. Set m{J,F) = J^^xJ{F{x))dF{x), and 
a^{J,F)= [ J{F{x))J{F{y)){F{x Ay) - F{x)F{y)) dxdy, 
where x Ay = min{x, y}. We also assume a'^{J, F) > 0. 
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Theorem 4.8. Suppose that the score function J is trimmed near and 
1, that is, J{u) = 0, li € [0, ti) U (t2, 1] where <ti <t2 <l. If J is bounded 
and continuous a.e. Lebesgue measure and a.e. , then {-^^{Ln — m{J, F)), 
n > 1} satisfies the LDP in M with speed a'^{n) and rate function I^{x) = 



Proof. By Theorem 1 in Boos (1979), we have 

\<^{G) - ^{F) - f{F{x) - G{x))J{F{x)) dx\ 
||G-F||-5>0 ||G-F|| 

Therefore, for any t„ — )■ 0+ and Hn — )• a G Pq with F + tnHn S T)^, 

mF + tnHn)-HF) 



hm 

n— >oo 



+ j Hn{x)J{F{x))dx 



0, 



and so, : P$ i— t- M is Hadamard-differentiable at F tangentially to T)q with 
respect to the uniform convergence, and ^'p{oi) = — J^a{x)J{F{x))dx,a G 

Pq- By Theorem 3.1, we conclude that {-^^^{Ln — m{J, F)),n > 1} satisfies 

the LDP in M with speed a^(n) and rate function given by 

I^(y) = inf I (a); - / a{x)JiF{x))dx = y], 



which equals the rate function of { — -^^ /K(-^n(^) ~ F{x))J{F{x)) dx, n > 1}, 
that is, I^{y) = 2a%j,F) - ° 

Now, let us remove the trimming restrictions on J. Set 

= |g(x) = G(x)/(_^,o)(x) + (G(x) - l)Iio,oo)i^y, 



GeV^, j \x\dG{x) <oo| 



and Vo = {a{G - F) = a{G - F);a G ]R,G G Then P$,Po C 

Define 4>:P$h^M by l>(G') = «>(G) for all GgP$. 

Lemma 4.2. //J zs Lipschitz continuous on [0,1], i/ien <^> : 2?$ i— t- M is 
Hadamard-differentiable at F tangentially to Vq with respect to -convergen- 
ce, and 

^'-{a) = - a{x)J{F{x))dx, aGPo- 
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Proof. By integration by parts, we can write [cf. Boos (1979), Shao 
(1989)] 



$(G)-$(F)+ {G{x)-F{x))J{F{x))dx = R{G,F) foranyGGP$, 
Jr 

where R{G,F) = f^WG,F{x){G{x) - F{x))dx, and 
I^l:^{J{t)-J{F{x)))dt 



0, 



G{x) - F{x) 



if G(a;) /F(x), 
ifG{x) = F{x). 



By the Lipschitz continuity of J, there exists a constant C > such that 



\R{G,F)\<C / {G{x)-F{x)ydx = G / {G{x) - F{x)y dx. 
Jr Jr 

For any i„ 0+ and F„ q G in (-^H^^); II ' IIlO with F + tnHn G 
then < 2/t„ and 

/ |i/„(x) -a(x)|2(ix< (||a|| +2/t„) / \Hn{x)-a{x)\dx, 
JR Jr 

where ||a|| = sup^-gig |q(x)|. Therefore, 



- / {F{x)+tnHn{x)-F{x)Ydx 
'"a 



and so 



hm 



<2tn i \Hn{x) - a{x)\^ dx + 2tn \a{x)\'^ dx 0, 
Jr Jr 

^iF + tnHn)-^{F) 



+ / Hn{x)J{F{x))dx 



which yields that $ is Hadamard-differentiable at F tangentiahy to T>o with 
respect to L-'^-convergence, and ^'p{a) = — J^a{x)J{F{x)) dx. □ 

Lemma 4.3. Let X be a random variable with values in a separable 
Banach space B and E'dlXp) <oo. Then (B^,d) is totally bounded, where 
B| is the unit ball of the dual space B* o/B, and 

dig, h) = {E{{g{X - E{X)) - h{X - E{X))f)f\ <7, G B^ 

Proof. Noting \g{X - E{X)) - h{X - E{X))\ < 2\\X - E{X)\\ for ah 
G B^ and -£^(11^ — ii^(X)|p) < oo, by the dominated convergence theo- 
rem, we know that the function [g, h) i— d{g, h) is continuous from B^ x B^ 
to M with respect to t(;*-topology. Let d* denote a compatible metric on 
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(Bl^w*). Since is w*-compact and d{g,g) = 0, then, for any e > 0, 
there exists some 5 > such that d{g,h) < e, if d*{g,h) < 6. Choose finite 
points G B| such that S| C \J'i^i{g;d* {g,hi) < 5}, then C 

\J^i{g',d{g,hi) <e}. Therefore, (Bl,d) is totahy bounded. □ 

Define 

A2,i(X)= / y/p{\X\>t)dt. 
Jo 

Then [cf. del Barrio, Gine and Matran (1999), page 1014], A2,i{X) < oo if 
and only if y^F(x)(1 — F{x)) dx < oo. 

Lemma 4.4. Assume that A2,i(X) < oo. // (4-15) holds and 

(4.26) limsup ^ log(nP(|X| > y/na{n))) = — oo, 

n->-oo a (n) 

*^eri {^(F„ -F) = ^(F„ -F),n > 1} C Po satisfies the LDP m (L^R), 
II ■ IIli) with speed a^{n) and rate function Ip. 

Proof. Set = I{x,<x} - F{x),x G M, then 

\U\L.=2(xiF{Xi)- [ xdF{x)]. 

V J(-oo,Xi) J 

Therefore, the condition of the lemma implies 

limsup^^log(nP(||^i||ii > ^ain))) = -oo, 



and by Theorem 2.1(b) of del Barrio, Gine and Matran (1999), we also have 
ll(n)^ X]r=i?«IUi — ^ 0. By Lemma 4.3, (B^,d) is totally bounded, where 

B^ := \ g^V^\\g\oo := esssup|5(x)| < \ \ 

and 

2\ \ 1/2 



{g(x) - h{x))ix{x)dx 



Therefore, by Theorem 2.8 in Arcones (2003a), the conclusion of the lemma 
holds. □ 



By Lemmas 4.4 and 4.2 and Theorem 3.1, we obtain the following result. 
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Theorem 4.9. Assume that A2,i(X) <oo, (4-15) and (4-26) hold. If J 
is Lipschitz continuous on [0,1], then {-^^{Ln — 'm{J,F)),n > 1} satisfies 
the LDP in M with speed a?{n) and rate function I^{x) = 2a'^(j F) ' 



Remark 4.3. From Remark 4.1, the moment condition in Theorem 4.9 
is weaker than the conditions given in Vandemaele and Veraverbeke (1982), 
Bentkus and Zitikis (1990) and Aleskeviciene (1991). In particular, if 
S(|X|2+^) < oo and a(n) = yToglogn, then the condition of Lemma 4.4 
is vahd, and so, for any r > 0, 



lim -, — -, 

n-s>oo log log n 



logP 



n 



log log n 



\Ln-miJ,F)\>r 



2(j2(J,F)' 



5. Application: Statistical hypothesis testing. In this section, we applied 
the moderate deviations to hypothesis testing problems. We only consider 
the right-censored data model. The method can be applied to other models. 

Let F be the unknown distribution function in the right-censored data 
model considered in Section 4.2 and let Fn be the Kaplan-Meier estimator 
of F. Consider the following hypothesis testing: 

Ho:F = Fo and Hi:F = Fi, 

where Fq and Fi are two distribution functions such that Fq{xq) ^ Fi{xq) 
for some xq G [0,r]. Similar to the Kolmogorov-Smirnov test, we take the 
Kaplan-Meier statistic r„ := sup^gjo.r] \ Pn{x) — -^0(2;)! as test statistic. Sup- 
pose that the rejection region for testing the null hypothesis Hq against Hi 
is {-^jTn > c}, where c is a positive constant. Then the probability an of 
Type I error and the probability of Type II error are 



On =P 



n 



respectively. Then 

^n<P 



Tn>C 



n 



n 



F = Fo and /3„ = P^r„ < c 

a[n) 



sup \Fn{x) - Fi{x) 



F = F, 



n 



> — - sup \Fo{x) - Fi{x)\ - c 



F = Fi 



Therefore, Theorem 4.4 implies that 



log On 



log/3„ 



-00, 
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where 

<^KM= sup {l-Fo{t)f I . . , . dAo(n), 

te[o,r] 7(0,4] (1 - AA{u))Ho{u) 

^o(0=/ ^-^dFo(s), Ho{t) = P{Z>t\F = Fo), 

J[o,t] l-Fo{s-) 

and Z is as defined in Section 4.2. 

The above resuh tells us that if the rejection region for the test is {^^^n ^ 
c}, then the probability of Type I error tends to with decay speed 

exp{-c^a'^{n)/{2aj^M)}, 

and the probability of Type II error tends to with decay speed exp{— ra^(n)} 
for all r > 0. 

6. Concluding remarks. This article discussed the large deviations of 
transformed statistics. For the problem, an extended contraction principle 
was developed and a general delta method in large deviation theory was 
proposed. The new method was used to establish the moderate deviation 
principles for the Wilcoxon statistic, the Kaplan-Meier estimator, the em- 
pirical quantile estimator and the empirical copula estimator, which have 
not been addressed in the literature. The proposed method was also used 
to improve the existing moderate deviation results for M-estimators and L- 
statistics, where our proofs are different from others but simpler by the new 
method. Moreover, our moderate deviation results are very useful for statis- 
tical hypothesis testing. As shown in Section 5, a moderate deviation result 
can be used to construct a test of a statistical hypothesis such that the prob- 
abilities of both Type I and Type II errors tend to with an exponentially 
decay speed as n — )• oo. 

Note that the asymptotics for multivariate trimming and general Z-estima- 
tors have been studied by using the delta method in a weak convergence; 
see Nolan (1992) and van der Vaart and Wellner (1996). Similar to those 
presented in Section 4, the moderate deviations for these estimators can be 
established by using the proposed delta method in large deviations. 

These applications show that the proposed method is very powerful for 
deriving moderate deviation principles on estimators. The method will play 
an important role in large sample theory of statistics like the functional 
delta method in weak convergence. Theoretically speaking, we can apply 
the proposed delta method to obtain moderate deviations for estimators 
where the classical delta method can be applied. 
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APPENDIX: PROOF OF THE EXTENDED CONTRACTION 

PRINCIPLE 

Step 1. First of all, let us prove {/ < 00} C Poo, where V^o denotes the 
set of all X for which there exists a sequence with Xn E and 

In fact, by the definition of Poo, x S Doo if and only if for any k>l, there 
exists a positive integer Uk such that Bd{x, 1/A;) n P„ 7^ for all n > Uk, 
where Bd{x, 1/k) = {y £ X\d{y,x) < Therefore, for any x ^ Voo, there 

exist an open neighborhood U olx and a subsequence {X^n^ , A; > 1} such that 
P^j. DU = for all A; > 1. Then by the lower bound of the large deviations 
for {Xn,n > 1}, we have 

-00 = lim inf — ^ log Pn,. ^ {Xn„ &U)> -I{x) , 

which implies {/ < 00} C ^^oo) where -Pn^^, is the inner measure correspon- 
ding to Prij. as defined in Section 2. 

Step 2. Let us prove that if some subsequence x^^, — )• x G {/ < 00} with 
Xnj. G Pnfe, then fn^i^nk) " ^ and the restriction of the function / to 
{/ < 00} is continuous. 

The proof is similar to that of the extended mapping theorem [see Theo- 
rem 1.11.1 in van der Vaart and Wellner (1996)], which is given below. Let 
a subsequence — )■ x G {/ < 00} be given. Since x £ Poo, there exists a se- 
quence X with yn G Vn for each n > 1. Define x„ = x„/|„^. fc>i}(n) + 
yn/{„j.^fc>i}c(n). Then x„ G fn for each n > 1 and x„ — ?• x. Therefore, by 
condition (ii), /n(x„) — )• /(x), and so fuki^rik) — ^ To prove the con- 
tinuity of / on {/ < 00}, let Xm — )• X in {/ < 00}. For every m, there is 
a sequence Xm,n E "Z^n with Xm,n Xm as 00. Since Xm £ {I < 00}, then 
fn{xm,n) " ^ f{xm) as n — )• OO. For every m, take rim such that is increas- 
ing with m satisfying d{xm,n,^,Xm) < l/m and (x„,,„„J, /(x^)) < 1/m. 
Then Xm,nm ~^ x, and by the first conclusion in Step 2, fnm{xm,n„J ~^ fix) 
as m— 00. This yields f{xm) — >• 

Step 3. Let us prove that [If < L\ = f{[I < L\) for any L > and If is 
inf-compact, that is, for any L G [0,+oo), [If < L] is compact. This can be 
shown by the continuity of /|{/<oo} obtained in Step 2. 

Step 4- Next, we show the upper bound of large deviations. 

Let be a closed subset in y. Then, using the arguments similar to the 
proof of the extended continuous mapping theorem [see Theorem 1.11.1 in 
van der Vaart and Wellner (1996)], 

00 00 

(A.l) n U fmHF) C rHF) U ({/ < oo})^ 

n=l m=n 
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Now for every fixed k, by the large deviation principle of > 1}, for 

each L > 0, there exists a compact subset such that for any (5 > 0, 

1 



and so 



limsup-f-logP*(X„ G {Kir) < -L, 



limsup-f-logP:(/„(X„)eF) 



< lim sup -f - log P* ( X, G M (F) I 
A(n) ^ ; 



< max< — inf I{x),—L 

where = {y;d{y,x) < 5 for some x € K^} and P* is the outer measure 
corresponding to P„ as defined in Section 2. Since Kl is compact and I is 
lower semi-continuous, then, when 6^0, 

inf I(,^)t inf I(x). 

Hence it follows that 
limsup--^logP*(/„(X„) eF) <maxl- inf J(x),-lI. 



Choose a sequence Xfc € Kl H Um=fe/m^(-^)' ^ > 1 such that I{xk) 
inf -—J — -—TT^Iix), and then choose a subsequence {xk^,m > 1} and 

xo € i^L such that Xk^ — )■ xq. Then we have 



xo G i^L n fl U /^H^) jCKLd {f-\F) U ({/ < oo})). 

\fc=l m=k / 

Letting A; — )• oo, we have 

liminf /(xfc) > /(xn) > inf I(x)'> inf /(x). 

fc^oo xei^in(/-i(-P)u({/<oo})':) xe/-i(F) 

Now letting L — )• oo, we conclude that 

limsup -f- logP„*(/„(X„) G F) < - inf /(x) = - inf /j(x). 

Step 5. Finally, we show the lower bound of large deviations: for any 
yo&y with Ifivo) < oo, 

liminf log PUfn{Xn)eB{yo, 6)) > -If{yo). 
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For any a > Ifd/o), there is some xq £ X with /(xq) = yo and I{xq) < a. 
For any 6 > 0, set B{6) = Bp{yo,6) = {y £ y;p{yo,y) < S} and = 3(5^. 
Then, by (A.l), we have 



(A.2) IJ IJ f;^MF{S)) D rHBiS)) n ({/ < oo}) 3 xo. 

n=l m=n 

Now for every fixed k, by the large deviation principle of {Xn}, we have 
liminf-i-logP„,(/„(X„) G B{6)) 

n-s>oo X[n) 



> hminf logP„, (xn G M /m'(F(<^)) ) > - mL 



I{x). 



Since xo £ rHBjS)) C U ^=i U^=„ /mH^W) , there is some k > 1 such 
that xo G U™=fc/m'(^('^))'- Therefore, 

liniinf-i-logP„,(/„(X„) e5(5)) > -I(xo) > -a. 
n->oo A(nj 

Letting a4,//(yo)i we obtain the lower bound of large deviations. 

Remark A.l. When T>n = X for ah n > 1, the continuity of / can 
be proved directly by the following property [see Theorem 2.1 in Arcones 
(2003a)]: Given e > 0, for any xo G {/ < oo}, there are 5 > and a positive 
integer no such that for all n > uq, fn{B{xQ,6)) C i?(/(xo),e). However, 
when Vn / X, fn{B{xQ,6)) is not well defined since B{xo,5) <f. T>n- Thus, 
the above property cannot be used for proving the continuity of / in this 
case. 
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